Skip to content

feat: GithubArchive benchmark for nested data#4960

Merged
a10y merged 7 commits into
developfrom
aduffy/gharchive
Oct 16, 2025
Merged

feat: GithubArchive benchmark for nested data#4960
a10y merged 7 commits into
developfrom
aduffy/gharchive

Conversation

@a10y

@a10y a10y commented Oct 15, 2025

Copy link
Copy Markdown
Contributor

GithubArchive is an archival dataset of Github event logs, hosted at https://www.gharchive.org/.

It's also distributed as part of RealNest from CWI.

This is our first benchmark to test nested data access with Vortex.

I want to get this in before we merge #4942 or any other follow on work for nested data support.

TODOs:

  • Add more data files. Currently queries finish way too fast
  • Add a few more queries

Loading
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

changelog/feature A new feature

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants